Information Theoretic and Statistical Drive Sanitization Models

نویسندگان

  • Jeffrey Medsger
  • Avinash Srinivasan
  • Jie Wu
چکیده

Current enterprise drive sanitization techniques employ little or no intelligence to determine if the area being sanitized, with data overwriting, actually contains sensitive resident data. All data blocks in the target area are sanitized, utilizing bruteforce sanitization techniques of one to several wipe passes. In reality, a significant number of drives needing sanitization may contain areas with no sensitive data, or even any data for that matter. Consequently, sanitizing such areas is counter-intuitive and counter-productive. In this paper, we propose two information-theoretic techniques – ERASE and ERASERS, which utilize an entropy measurement of data blocks for quick and effective drive sanitization. Our first technique, ERASE, unlike current brute-force methods, computes the entropy of each data block in the target area. Subsequently, all data blocks, which have an entropy within the user-specified sensitivity range, are wiped. Our second technique, ERASERS, which is an extension of ERASE, employs random sampling to enhance the speed performance of ERASE. To achieve this, ERASERS divides the target area into subpopulations, performs random sampling of blocks from each subpopulation, and computes the entropy of each sampled block. If the entropy of any sampled block, within a subpopulation, is within the user-specified sensitive entropy range, the entire subpopulation is wiped. The random sampling component of ERASERS gives organizations an alternative for a faster wipe, compared to the currently employed brute-force sanitization techniques. We have presented results, which compare the performance of our proposed techniques against the current brute-force technique. In a test, performed on the HFS+ unallocated space of an Apple MacBook Pro, used under real-world conditions, ERASERS averaged a speed improvement of 50.47% over a brute force technique, while retaining an accuracy of 99.84%, when set to a greater than 0 bpB and less than or equal to 8 bpB entropy range.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Reliably Erasing Data from Flash-Based Solid State Drives

Reliably erasing data from storage media (sanitizing the media) is a critical component of secure data management. While sanitizing entire disks and individual files is well-understood for hard drives, flash-based solid state disks have a very different internal architecture, so it is unclear whether hard drive techniques will work for SSDs as well. We empirically evaluate the effectiveness of ...

متن کامل

SAFE: Fast, Verifiable Sanitization for SSDs

As users, corporations, and government agencies store more data in digital media, managing that data and access to it becomes increasingly important. Reliably removing data from persistent storage (i.e., sanitizing the storage) is an essential aspect of this management process, and several techniques that reliably delete data from hard disks are available as built-in ATA or SCSI commands, softw...

متن کامل

A Novel Sanitization Approach for Privacy Preserving Utility Itemset Mining

Data mining plays a vital role in today’s information world wherein it has been widely applied in various business organizations. The current trend in business collaboration demands the need to share data or mined results to gain mutual benefit. However it has also raised a potential threat of revealing sensitive information when releasing data. Data sanitization is the process to conceal the s...

متن کامل

A Systematic Analysis of XSS Sanitization in Web Application Frameworks

While most research on XSS defense has focused on techniques for securing existing applications and re-architecting browser mechanisms, sanitization remains the industry-standard defense mechanism. By streamlining and automating XSS sanitization, web application frameworks stand in a good position to stop XSS but have received little research attention. In order to drive research on web framewo...

متن کامل

Detecting Sensitive Information from Textual Documents: An Information-Theoretic Approach

Whenever a document containing sensitive information needs to be made public, privacy-preserving measures should be implemented. Document sanitization aims at detecting sensitive pieces of information in text, which are removed or hidden prior publication. Even though methods detecting sensitive structured information like e-mails, dates or social security numbers, or domain specific data like ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015